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(54) Title: IMPROVED METHOD AND APPARATUS FOR PROTECTING PUBLIC KEY SCHEMES FROM TIMING AND FAULT 
ATTACKS 

(57) Abstract 

Improved methods and apparatus for protecting pub- 
lic key schemes based on modular exponentiation (including 
RSA and Diffie-Heltman) from indirect cryptanalytic tech- 
niques such as timing and fault attacks. Known methods for 
making the implementation of number-theoretic schemes re- 
sistant to such attacks typically double their running time, 
whereas the novel methods and apparatus described in this 
patent add only negligible overhead. This improvement is 
particularly significant in smart card and software-based im- 
plementations, in which the modular exponentiation opera- 
tion is quite slow, and doubling its time may be an unac- 
ceptable solution. 
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IMPROVED METHOD AND APPARATUS FOR PROTECTING 
PUBLIC KEY SCHEMES FROM TIMING AND FAULT ATTACKS 

FIELD OF INVENTION 

The present invention relates to novel techniques, methods and apparatus, 

for making number-theoretic public key schemes (including encryption schemes, 

signature schemes, identification schemes, key management schemes, etc.) resistant to 

timing and fault attacks. 

BACKGROUND OF INVENTION 

L Introduction . 

The simplest attack on a given cryptosystem is to exhaustively search for 
the key. There are many variants of this attack (known ciphertext, known cleartext, 
chosen cleartext, etc.), but they are all based on a procedure which tries the keys one by 
one until the correct key is encountered. If the key consists of n random bits, the 
expected running time of this procedure is 2^(n-l). This attack can be easily foiled by 
using a large enough n (e.g., n>100). 

To attack cryptosystems with large keys, cryptanalysts try to find 
mathematical or statistical weaknesses which reduce the size of the search space 
(preferably to 1). Although many techniques and results are classified for national 
security reasons, it is safe to assume that it is increasingly difficult to find such 
weaknesses in modern schemes designed by experienced cryptographers and 
implemented on high speed microprocessors. 

To successfully attack strong cryptosystems, the ciyptanalyst must use 
indirect techniques. This is best done when the ciyptanalyst is either in close physical 
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proximity to the ciyptographic device, or has it under his complete control. The 
cryptographic device is assumed to be a black box which contains a known algorithm and 
an unknown key. The cryptanalyst cannot open this box and read its key, but he can 
observe its behavior under various circumstances. 

One of the best known examples of such an indirect attack is TEMPEST, 
which tries to deduce the key by analyzing electromagnetic radiation emanating from the 
black box during the computation of the ciphertext. Techniques for applying and 
preventing such attacks have been extensively studied for more than 50 years, and by 
now this is a well understood problem. 

Two powerful indirect attacks were discovered and published recently: In 
December 1995, P. Kocher, "Oyptanalysis of Diffie-Hellman, RSA, DSS, and Other 
Systems Using Timing Attacks," technical report, 12/7/95, described a timing attack, and 
in September 1996, D. Boneh, R.A. Demillo and R.J. Iipton, "On the Importance of 
Checking Computations," technical report, 9/25/96 (an extended version appears in the 
Proceedings of Eurocrypt 97, May 1997) described a fault attack. Both attacks were 
originally designed for and are most successful against public key schemes based on 
number theoretic principles, such as RSA, but they were later extended to classical 
ciyptosystems as well (e.g., by E. Biham and A. Shamir, "A New Cryptanalytic Attack on 
DES," technical report, 10/18/96. An extended version appears in the Proceedings of 
Crypto 97, August 1997). 

Such attacks are particularly useful when the scheme is implemented on a 
smart card, which is distributed by a bank, computer network, cellular phone operator, or 
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pay-TV broadcaster to its customers. Hackers do not usually have the financial and 
technical resources required to read the contents of the key registers inside the smart 
card, but they have complete control on the input/output, clock, reset, and power 
connections of the smart card. They can carefully measure the duration of the various 
operations, how much power they consume, what happens when the computation is 
interrupted or carried out under abnormal operating conditions, etc. Since the tests are 
carried out in the privacy of the customer's home, the card manufacturer cannot prevent 
them or even learn about their existence. 
2. Timing Attacks . 

Timing attacks are based on the assumption that some of the basic 
operations carried out during the cryptographic calculation require a non-constant 
amount of time which depends on the actual values being operated upon. This. implies 
that some information about these unknown intermediate values leaks out by measuring 
the length of the cryptographic computation. If these intermediate values are computed 
from known cleartext bits and unknown key bits by a known cryptographic algorithm, 
the attacker can try to use the leaked intermediate values to deduce the key. 

The main difficulty in carrying out this attack is that the attacker knows 
only the total amount of time required to carry out the cryptographic computation, but 
not the timing of individual computational steps. Kocher's main contribution is in 
developing an efficient technique for handling this difficulty in many cases of practical 
interest. 

For the sake of concreteness, we describe Kocher's attack on the RSA 

3 



BNSDOCID: <WO 9S5231 9A1 _l_> 



PCT/US98/09593 

cryptosystem. The black box is assumed to contain a publicly known modulus n and a 
secret exponent d. Given an input number x, the box performs the modular 
exponentiation x ^ d (mod n) by using the standard square-and-multiply technique. In 
this description, the symbol " ^ " is exponentiation and the symbol " is a subscript. The 
result (which can be the decryption of the ciphertext x, the signature of the message x, or 
the response to a random identification challenge x, depending on the application) is 
sent out as soon as it is produced, and thus the attacker can measure the total number of 
clock cycles taken by all the modular multiplications. 

Standard implementations of modular multiplication require a non-constant 
amount of time, since they skip multiplication steps involving leading zeroes, and 
reduction steps when the result is smaller than the modulus. The attacker chooses a 
large number of random inputs x, and measures the actual timing distribution T_0 of the 
modular exponentiation operation carried out by the black box. He then measures for 
each x (by computer simulation, using his knowledge of how the scheme is implemented) 
the precise timing of an initial square-only operation, and separately, the precise timing 
of an initial square-and-multiply operation. The result is a pair of timin g distributions 
T_l and T_2, which are not identical. All the cryptographic computations carried out in 
the black box use the same exponent d, and its first bit determines which one of the two 
computed distributions T_l and T_2 is the initial part of the experimentally computed 
T_0. The timing of the remaining steps of the computations can be assumed to be a 
random variable R, which is normally distributed and uncorrelated with either T_l or 
T_2. Since T_0 is either TJ.+R or T_2+ R, the attacker can decide which case is more 
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likely by finding which one of the two distributions T_0-T_1 and T_0-T_2 has a lower 
variance. 

After finding the first bit of the secret exponent d, the attacker knows the 
actual inputs to the second computational step, and thus he can apply the same 
technique (with properly modified experimental and simulated timing distributions T^O, 
T_l f and T_2) to find the second bit of d. By repeating this procedure about 1000 
times, he can compute all the bits of d, and thus break the RS A scheme. 

A similar timing attack can be applied to any cryptographic scheme in 
which the black box raises all its inputs x_l,x_2,... to the same secret power d modulo 
the same known n (which can be either a prime or a composite number). For example, 
in one of the variants of the Diffie-Hellman key distribution scheme, all the users agree 
on a prime modulus n and on a generator g of the multiplicative group Z ^ * ja. i 

Each user chooses a random secret exponent d, and computes y=g^d v ^ 

(mod n) as his public key. To establish a common secret key with another user, the first 
user sends out his public key y=g~d (mod n), and receives a similarly computed public ^ . 
key x=g~e (mod n) from the other user. Their common cryptographic key is 
z=g^ (d*e) (mod n) which the first user computes by evaluating x^d (mod n). When 
the first user communicates with several parties, he raises several known values 
xJL,x_2,...to the same secret power d modulo the same known modulus n. By measuring 
the timing of sufficiently many such computations, the attacker can determine d and thus 
find all the cryptographic keys z_l,z_2,.~ employed by that user. 

The timing attack has to be modified if the computation of x ^ d (mod n) 
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for a composite modulus n=p*q is carried out by computing x^d (mod p), x^d (mod 
q), and combining the results by the Chinese Remainder Theorem (CRT). This is a 
common way of making the computation about 4 times faster when the factorization of n 
is known. The problem for the attacker is that he does not know the secret factors p 
and q of the public modulus n, and thus cannot simulate the timing distributions T_l and 
TJL Kocher's solution is to concentrate on the first step of the CRT computation, in 
which the input x is reduced modulo p. If x is smaller than p, no modular reduction is 
required, and thus the computation is considerably faster than when x is larger than or 
equal to p. The attacker thus presents to the black box a large number of inputs x which 
are very close to each other, and uses the average time of such computations to decide 
whether these x"s are above or below p. A decision procedure for this question can be 
repeatedly used to find the precise value of p by binary search. 

Shortly after the discovery of this attack, researchers tried to develop 
implementations which are immune to it. The simplest idea is to make sure that all the 
cryptographic operations take exactly the same amount of time, regardless of the values 
of the cleartexts and keys. However, achieving this is surprisingly difficult for the 
following reasons: 

(a) In many cases, the implementor wants to run the same algorithm in 
software on different (and perhaps unknown) machines. An 
implementation which is constant time on one microprocessor may 
be variable time on another microprocessor or even on an enhanced 
version of the same microprocessor. 
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(b) On a multitasking machine the running time may depend on the 
amount of free memory, the cache hit rate, the number of external 
interrupts, etc. This can change a constant time implementation 
under one set of circumstances into a variable time implementation 
under another set of circumstances. 

(c) If the implementor tries to use a real time clock to force all the 
computations to take the same amount of time, he must slow all of 
them down to their worst cases. Since he cannot use any input- 
dependent optimization technique, the implementation is likely to 
be unacceptably slow. 

The best protective technique proposed so far against Kocher's timing 
attack on modular exponentiation is to replace each input x by a modified version y=x*r 
(mod n) where r is a secret random number between 1 and n-1. To compute x^d (mod 
n), the program computes y^d (mod n) and r^d (mod n), and then uses the 
multiplicative property of modular exponentiation to compute x ^ d (mod n) as y ^ d/r ~ d 
(mod n). Since both y and r are unknown, the attacker cannot simulate these 
computations in order to find the successive bits of d in the non-CRT computation, and 
cannot perform binary search in the CRT version of the computation* Unfortunately, 
this randomization technique doubles the expected running time of the computation. 
3. Fault Attacks . 

Fault attacks try to introduce errors into the cryptographic computation, 
and to identify the key by analyzing the mathematical and statistical properties of the 
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erroneously computed results. . Among the many techniques suggested so far for 
introducing such errors are the use of ionizing radiation, unusual operating temperatures, 
power and clock glitches, and laser-based chip microsurgery. Some of the attacks are 
differential (i.e., they carry out both a correct and an erroneous computation with the 
same input and analyze their differences), while other attacks just use the erroneous 
results. 

The original fault attack on public key ciyptosystems was described in 
Boneh, Demillo and Lipton, and required several cryptographic computations. We now 
describe an improved version of this attack, due to Arjen Lenstra, which requires a single 
faulty computation. We assume that the black box uses the RSA scheme to sign a given 
message x. The computation of x ^ d (mod n) is carried out with the CRT method by 
first reducing x modulo p and q to get x_l and x_2, then computing y_l=x_l ~d (mod p) 
and y_2=x_2 ~ d (mod q), and finally combining y_l and y_2 to get the signature y (mod 
n) with the CRT method. We assume that a single error is introduced at a random time 
during this computation by applying mild physical stress to the black box. Without loss 
of generality, we can assume that the error was introduced during the computation of 
x_l ~ d (mod p), and thus instead of getting the correct y_l, the box computed an 
erroneous y'_l. When y*_l and y_2 are combined by the CRT method, the box 
computes an incorrect signature y* which is provided to the attacker. 

The main observation is that the attacker knows the signature verification 
exponent e, for which y ~e=x (mod n). Due to the error, / ^e-x is non-zero mod p, but 
zero mod q, and thus it is a multiple of q which is not a multiple of n. The attacker can 
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thus factor n by computing the greatest common divisor of y' ^ e-x (mod n) and n, which 
is an easy computation. 

To protect cryptographic schemes against fault attacks, Boneh, DemiUo and 
Lipton recommend that each computation should be carried out twice (preferably by 
different algorithms). If any discrepancy is found between the two results, the box 
should not output anything. This provides strong protection from random faults (which 
are unlikely to affect the two computations in an identical way), but it slows down the 
computation by a factor of 2. Such a slowdown is particularly noticeable in smart card 
implementations of public key schemes, which are quite slow to begin with. 

SUMMARY OF THE INVENTION 
The present invention relates to method and apparatus for providing protective 
techniques for public key schemes, which provide strong protection against the described 
timing and fault attacks without incurring the twofold slowdown made necessary by the 
previously known protective techniques. 

The first technique is designed to protect non-CRT implementations of 
public key schemes against timing attacks. It is applicable to the RSA ciyptosystem, 
RSA digital signature scheme, Diffie-Hellman key distribution scheme, and any other 
number theoretic scheme in which the black box raises a known input x to a fixed secret 
exponent d modulo a public n whose factorization is known to the black box. 

The second technique is designed to protect CRT-based implementations 
of public key schemes from both timing and fault attacks. The main problem is how to 
verify the correctness of the computations of x_l * d (mod p) and xJ2 ~ d (mod q) 

9 
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without repeating them a second time (or verifying each step separately, which again 
doubles the running time). The invention provides a novel error-detection technique for 
such number theoretic computations which is much more efficient. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 shows schematically the method and apparatus of the invention as it relates 
to a first technique designed to protect non-CRT implementations of a public key 
scheme against timing attacks. 

Fig. 2 shows schematically the method and apparatus of the invention as it relates 
to a second technique designed to protect CRT-based implementations of a public key 
scheme against both timing and fault attacks. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Referring now to the drawings, preferred embodiments will now be described in 
detail. Since public key schemes and computer hardware and software implementation 
are well known to those of skill in the art, no description of same is deemed necessary to 
a full, concise and exact understanding of the present invention. 

We now describe two novel protective techniques for public key schemes, which 
provide strong protection against the described timing and fault attacks without incurring 
the twofold slowdown made necessary by the previously known protective techniques. 

The first technique, shown in Fig. 1, is designed to protect non-CRT 
implementations of public key schemes against timing attacks. It is applicable to the 
RSA cryptosystem, RSA digital signature scheme, Diffie-Hellman key distribution 
scheme, and any other number theoretic scheme in which the black box 10 raises a 

10 
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known input x to a fixed secret exponent d modulo a public n whose factorization is 
known to the black box. 

The main observation is that for each n there exists a number t=phi(n), 
box 12, such that for any x between 1 and n which is relatively prime to n, x~t=l (mod 
n). This phi is called Euler's totient function: when n is prime, phi(n)=n-l, and when 
n=p*q, phi(n) = (p-l)*(q-l). The implementation can thus replace the computation of 
x^d (mod n) by the computation of x~(d+i*t) (mod n) for any integer i without 
changing the computed result, since x /N (d+it)=(x /s d)*(x^ v t) ~i = (x~d)*l ^i=x~d 
(mod n), box 14. 

This equality can be shown to be true even when x is not relatively prime 
to n, but this case is unlikely to arise in practice. Note also that any integral multiple of 
GCD((p-l), (q-1)) can replace phi(n) in our choice of L 

The proposed protection technique for public key schemes based on the 
modular exponentiation operation is to choose a new random secret i in each 
computation, and to replace the computation of x^d (mod n) by the computation of 

(d+i*t) (mod n) where t is the precomputed value of phi(n). Since the bits of these 
(d-hi*t) for the various i are different, each exponentiation uses a different sequence of 
square-and-multiply steps, and thus the attacker cannot use Kocher's timing attack to 
analyze the timing distribution of several exponentiations, even though all of them 
compute the same d-th power of their inputs. 

The efficiency of this technique is based on the fact that t is always smaller 
than n, and the randomizing element i can be chosen as a relatively small number. If n 
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and d are 1024 bit numbers, and i is a random 32 bit number (which are the currently 
recommended sizes), d+i*t is a 1056 bit number. The process of raising the input x to 
the 1056 bit power d+ i*t requires only 3% more square-and-multiply operations than the 
process of raising x to the original 1024 bit power d. This is much better than the 
alternative randomization technique described earlier, which doubles the running time. 

The second technique, shown in Fig. 2, is designed to protect CRT-based 
implementations of public key schemes from both timing and fault attacks. The main 
problem is how to verify the correctness of the computations of x_l ^d (mod p) and 
x_2~d ( mod q) without repeating them a second time (or verifying each step separately, 
which again doubles the running time). We now describe a novel error-detection 
technique for such number theoretic computations which is much more efficient. 

In each computation, the black box 20 chooses a new random integer j (the 
recommended size of j is also 32 bit), box 22. Instead of computing x_l=x (mod p) and 
x_2=x (mod q) followed by y_l=x_l~d (mod p) and y_2=x_2~d (mod q), the box 
computes v_l=x (mod j*p), v_2=x (mod j*q), d_l=d (mod phi(j*p)), and dJ2=d (mod 
phi(j*q)), box 24, followed by wJL=v_l~d_l (mod j*p) and w_2=v_2~d_2 (mod j*q), 
box 26. 

The main observation is that from w_l and w_2 it is easy to derive y_l and 
y_2 by further reductions (namely, y_l=w_l (mod p) and y_2=w_2 (mod q)), box 28, 
and thus it is easy to compute the final result y by the Chinese remainder Theorem, box 
30. However, we can also obtain the value of x^d (mod j) in two different ways: as 
w_l (mod j) and as w_2 (mod j), box 32. We can now use the equality of these two 
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values (which were obtained from the two halves of the computation, and mixed with the 
derivation of y_l, y_2 in a very strong way), box 34, as a test of correctness: In a 
faultless computation the two values will always be the same, whereas in a faulty 
computation (with random faults) the probability that the two values will be the same is 
about 1/(2^32), see decision box 36 in which an abort is commanded for a faulty 
computation- This error detection technique is thus sufficient for any application in 
which the total number of modular exponentiations is significantly smaller than 2^32 
(about 4 billion). 

The overall time complexity of this implementation is higher by a few 
percent than the time complexity of standard implementations, since the exponentiations 
are carried out modulo 5124-32=544 bit moduli j*p and j*q instead of 512 bit moduli p 
and q. However, this is much faster than repeating each exponentiation a second time to 
verify its correctness. 

An additional benefit of this randomization technique is that it also 
provides protection from timing attacks at no extra cost. Kocher's original attack on 
CRT-based implementations concentrates on the initial modular reduction (mod p), and 
uses binary search to find increasingly accurate approximations of p from multiple 
computations. 

By using our proposed technique, each computation uses a different modulus j*p 
in its initial reduction step, and thus the attacker cannot refine his knowledge of the 
modulus by analyzing a large number of exponentiations. 

There are many optimizations and variations of this technique, which 
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should be obvious to anyone skilled in the art. For example, it is possible to impose 
additional restrictions (such as primality) on the choice of the small multiplier j which 
make it somewhat less likely that faulty computations will remain undetected. Another 
modification of the technique is to test each half of the computation by a separate 
recomputation modulo a different small modulus, rather than by comparing the two 
results modulo a common small modulus. More precisely, the implementation can 
choose two small numbers j_l and j_2, and then compute the following quantities: 



dJL=d (mod phiQ_l*p)), d_2=d (mod phi(j_2*q)), d_3 = d (mod phi(j_l)), 
d_4=d (mod phi(j_2); 

w_l=v_l~d_l (mod j_l*p), w_2=v_2~d_2 (mod j_2*P), w_3=v_3 ~ d_3 
(mod w_4=v_4~d_4 (mod j_2). 

To check the correctness of the computation, the black box verifies that 



w_l=w_3 (mod and w_2=w_4 (mod j_2). The only expensive operations are the 
computation of w_l and w_2, since the small exponentiations in the computation of w_3 
and w_4 are very efficient. This recomputation technique is slower than the original 
comparison technique, but may be slightly more resistant to certain types of non-random 
faults. 

While the invention has been described with respect to certain embodiments 
thereof, it will be appreciated by one skilled in the art that variations and modifications 
may be made without departing from the spirit and scope of the invention. 



v_l=x (mod j_l*p), v_2=x (mod j_2* 
L2); 



q), v_3=x (mod j_JL), v_4=x (mod 
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WHAT IS BEING CLAIMED IS 

1. In a method of implementing public key schemes containing the 
non-CRT form of the modular exponentiation operation x^d (mod n), for the purpose 
of making them more resistant to timing attacks, the improvement comprising the steps 
of: 

a. computing or storing the computed value of t=phi(n), where phi is 
Euler's totient function of the modulus n; 

b. selecting some secret integer i; and 

c. replacing the computation of x~d (mod n) by the computation of 
x~(d+i*t) (mod n). 

2- In the method of claim 1, the improvement where i is chosen as a 
random number in the range [0, (2~k)-l] for some k. 

3, In the method of claim 2, the improvement where k=32. 

4. In a method of implementing public key schemes containing the 
CRT form of the modular exponentiation operation x~d (mod n) where n=p*q, for the 
purpose of making them more resistant to timing and fault attacks, the improvement 
comprising the steps of: 

a. selecting some secret integer j; 

b. computing v_l=x (mod j*p), v_2=x (mod j*q), d_l=d (mod 
phi(j*p)), d_2=d (mod phi(pq)), w_l=v_l~d_l (mod j*p), and 
w_2=v_2~d_2 (mod j*q); 

c. aborting the computation if w_l and w_2 are not equal modulo j; 
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otherwise, computing y_l =w_l (mod p), y_2=w_2 (mod q), and 
combining them by the Chinese Remainder Theorem to obtain the 
result of x^d (mod n). 

In the method of claim 4 improvement where j is chosen as a prime 



number. 



6. In the method of claim 4 improvement where j is chosen as a 
random number in the range [0,(2 ^k)-l] for some k. 

7. In the method of claim 5 improvement where k=32. 

8. In a method of implementing public key schemes containing the 
CRT form of the modular exponentiation operation x^d (mod n) where n=p*q, for the 
purpose of making them more resistant to timing and fault attacks, the improvement 
comprising the steps of: 

a. selecting two secret integers j_l and j_2; 

b. computing v_l=x (mod j_l*p), v_2=x (mod j_2*q), d_l=d (mod 
phi(jJL*P))> d_2=d (mod phi(j_2*q)), wJ=vJ A d_l (mod j_l*p), 
and w_2=v_2~d_2 (mod j_2*q); 

c. computing v_3=x (mod j_l), v_4=x (mod j_2), d_3=d (mod j_l), 
d_4=d (mod j_2), w_3=v_3~d_3 (mod j_l), and w_4=v_4~d_4 
(mod j_2); 

d. aborting the computation if w_3 is not equal to w_l modulo j_l, or 
if w_4 is not equal to w_2 modulo j_2; d. otherwise, computing 
y_l=w_l (mod p), y_2=w_2 (mod q), and combining them by the 
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Chinese Remainder Theorem to obtain the result of x^d (mod n). 

9. In the method of claim 8 improvement where j_l and jJZ are prime 

numbers. 

10. In the method of claim 8 improvement where j_l and j_2 are chosen 
as random numbers in the range [0,(2 ^k)-l] for some k. 

11. In the method of claim 10 improvement where k=32. 

12. In apparatus for implementing public key schemes containing the 
non-CRT form of the modular exponentiation operation x^d (mod n), for the purpose 
of making them more resistant to timing attacks, the improvement comprising: 

a. means for computing or storing the computed value of t=phi(n), 
where phi is Eider's totient function of the modulus n; 

b. means for selecting some secret integer i; and 

c. means for replacing the computation of x^d (mod n) by the 
computation of x~(d+i*t) (mod n). 

13. In apparatus according to claim 1, the improvement where i is 
chosen as a random number in the range [0, (2~k)-l] for some k. 

14. In apparatus according to claim 2, the improvement where k=32. 

15. In apparatus for implementing public key schemes containing the 
CRT form of the modular exponentiation operation x^d (mod n) where n=p*q, for the 
purpose of making them more resistant to timing and fault attacks, the improvement 
comprising: 

a. means for selecting some secret integer j; 
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b. 



means for computing v_l=x (mod j*p), v_2=x (mod j*q), d_l=d 



(mod phi(j*p)), d_2=d (mod phi(j*q)), w_l=v_l~d_l (mod j*p), 



and w_2=v_2~d_2 (mod j*q); 



c. 



means for aborting the computation if w_l and w_2 are not equal 



modulo j; 



d. 



otherwise, means for computing y_l =w_l (mod p), y_2=w_2 (mod 



q), and combining them by the Chinese Remainder Theorem to 
obtain the result of x^d (mod n). 

16. In apparatus according to claim 4, the improvement where j is 
chosen as a prime number. 

17. In apparatus according to claim 4, the improvement where j is 
chosen as a random number in the range [0,(2 ^k)-l] for some k. 

18. In apparatus according to claim 5 the improvement where k=32. 

19. In apparatus for implementing public key schemes containing the 
CRT form of the modular exponentiation operation x^d (mod n) where n=p^q, for the 
purpose of making them more resistant to timing and fault attacks, the improvement 
comprising: 



selecting two secret integers j_l and j_2; 



b. 



computing v_l=x (mod j_l*p), v_2=x (mod j_2*q), d_l=d (mod 



phi(j_l*p)), d_2=d (mod phi(j_2*q)), wJ=vJ A d_l (mod 



P), 



and w_2=v_2^d_2 (mod j_2*q); 



computing v_3=x (mod j_l), v_4=x (mod j_2), d_3=d (mod j_l), 
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d_4=d (mod j_2), w^v^d^ (mod j_l), and w_4=v_4~d_4 
(mod j_2); 

d. abortiBg the computation if wJ3 is not equal to w_l modulo or 
if w_4 is not equal to w_2 modulo j_2; d. otherwise, computing 
y_l=w_l (mod p), y_2=w_2 (mod q), and combining them by the 
Chinese Remainder Theorem to obtain the result of x^d (mod n). 

20. In apparatus according to claim 8, the improvement where j_l and 
j_2 are prime numbers. 

21. In apparatus according to claim 8, the improvement where j_l and 
j_2 are chosen as random numbers in the range [0,(2~k)-l] for some k. 

22. In apparatus according to claim 10, the improvement where k=32. 
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REPLACE xJ=x(mod p) AND x_2=x(mod q) 
BY v_l=x(mod j*p) AND v_2=x(mod j*q) 
d_l=d(mod phi(j*p)) AND d_2=(mod phiQ*q)) 
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